Durao ef al. Progress in Orthodontics 2013, 14:31 
http://www.progressinorthodontics.eom/content/1 4/1 /3 1 



O Progress in Orthodontics 

a SpringerOpen Journal 



REVIEW Open Access 



Validity of 2D lateral cephalometry in 
orthodontics: a systennatic review 

Ana R Durao^", Pisha Pittayapat^, Maria Ivete B Rockenbach^ Raphael Olszewski"^, Suk Ng^ Afonso P Ferreira^ 
and Reinhilde Jacobs^ 



Abstract 

Lateral cephalometric radiography is commonly used as a standard tool in orthodontic assessment and treatment 
planning. The aim of this study was to evaluate the available scientific literature and existing evidence for the 
validation of using lateral cephalometric imaging for orthodontic treatment planning. The secondary objective was 
to determine the accuracy and reliability of this technique. We did not attempt to evaluate the value of this 
radiographic technique for other purposes. A literature search was performed using specific keywords on electronic 
databases: Ovid MEDLINE, Scopus and Web of Science. Two reviewers selected relevant articles, corresponding to 
predetermined inclusion criteria. The electronic search was followed by a hand search of the reference lists of 
relevant papers. Two reviewers assessed the level of evidence of relevant publications as high, moderate or low. 
Based on this, the evidence grade for diagnostic efficacy was rated as strong, moderately strong, limited or 
insufficient. The initial search revealed 784 articles listed in MEDLINE (Ovid), 1,034 in Scopus and 264 articles in the 
Web of Science. Only 17 articles met the inclusion criteria and were selected for qualitative synthesis. Results 
showed seven studies on the role of cephalometry in orthodontic treatment planning, eight concerning 
cephalometric measurements and landmark identification and two on cephalometric analysis. It is surprising that, 
notwithstanding the 968 articles published in peer-reviewed journals, scientific evidence on the usefulness of this 
radiographic technique in orthodontics is still lacking, with contradictory results. More rigorous research on a larger 
study population should be performed to achieve full evidence on this topic. 
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Review 

Introduction 

Since the introduction of lateral cephalometric radio- 
graphy in 1931 by Broadbent in the USA and by Hofrath 
in Germany, this radiograph and its related analyses have 
become a standard tool in orthodontic assessment and 
treatment planning [1-3]. Lateral cephalometric radio- 
graphs are systematically collected prior to orthodontic 
treatment in many European countries [3,4]. Neverthe- 
less, the real value of this imaging technique for diagno- 
sis and planning of orthodontic treatment remains 
uncertain [2-7]. Some authors showed that an adequate 
orthodontic diagnosis and treatment plan could not be 
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performed without comparing a lateral cephalometric 
radiograph before and after orthodontic treatment and 
that treating skeletal malocclusions without a cephalo- 
metric radiograph introduced serious errors [8]. While 
only a small percentage of the orthodontic treatment 
planning are modified based on lateral cephalometric 
radiographic analysis [6], it could adjust some aspects of 
treatment planning, such as tooth extraction, extract 
pattern and anchorage features [2,9] . 

The controversy about the correct use of the lateral 
cephalometric radiograph is also present in orthodontic 
textbooks where guidelines for orthodontic imaging are 
not expressed properly [10]. Several radiographic tech- 
niques, like panoramic and full-mouth periapical radio- 
graphs, used in orthodontics are found unproductive, 
since it provides duplicate information [4]. The latter is 
an important finding as the use of ionising radiation 
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should always be justified and kept 'as low as reasonably 
achievable' and definitely in children as radiographs are 
often performed at different time intervals during ortho- 
dontic treatment [11,12]. Even when there are means to 
optimise radiation dose of cephalometric radiographs, the 
primary issue is to justify the decision to take a lateral 
cephalogram prior to orthodontic treatment [10,11,13-15]. 

The present systematic review was initiated by the 
fact that three-dimensional (3D) cephalometric analysis 
is emerging, while there is still lack of scientific evi- 
dence on the validity and reliability of two-dimensional 
(2D) cephalometric imaging for orthodontic treatment 
planning [2,3]. 

Therefore, the aims of this study were to systematically 
review the available scientific literature and to evaluate the 
existing evidence about the validation of lateral cephalo- 
metric radiograph in orthodontics. This review also stud- 
ied the accuracy and reliability of lateral cephalograms and 
its cephalometric analysis. 

Materials and methods 
Information sources 

A comprehensive electronic database search to iden- 
tify relevant publications was conducted, and the 
reference lists in relevant articles were searched ma- 
nually for additional literature. We set no language 
limitations, although we did not attempt to explore 
the informally published literature: conference pro- 
ceedings and abstracts of research presented at con- 
ferences and dissertations. The following databases 
were searched: Ovid Medline (1946 to 11 January 
2012), Scopus (to 11 January 2012) and Web of Sci- 
ence (1899 to 11 January 2012). 

Search strategy 

We developed the search strategy with the help of an in- 
formation specialist at King's College London Dental 
Institute in London. The searches did not have a date 
limit and were not restricted to particular types of study 
design. The search strategy focused on the following 
terms: Cephalometr* and (orthodontic* or 'orthodontic 
treatment planning') and ('efficacy' or 'reproducibility' 
or 'repeatability' or 'reliability' or 'accuracy' or 'validity' 
or 'validation' or 'precision' or 'variability' or 'efficiency' 
or 'comparison') not ('Cone-Beam Computed Tomog- 
raphy' or 'Three-Dimensional imaging' or 'Cone Beam 
Computed Tomography' or 'Cone Beam CT' or 'Volu- 
metric Computed Tomography' or 'Volume Computed 
Tomography' or 'Volume CT' or 'Volumetric CT' or 
'Cone beam CT' or 'CBCT' or 'digital volume tomography' 
or 'DVT' or 'Spiral Computed Tomography' or 'Spiral 
Computer- Assisted Tomography' or 'Spiral Computerized 
Tomography' or 'spiral CT Scan' or 'spiral CT Scans' 
or 'Helical CT' or 'Helical CTS' or 'Helical Computed 



Tomography' or 'Spiral CAT Scan' or 'Spiral CAT Scans' 
or '3D' or '3-D' or 'three dimension*'). 

Study selection 

At the first stage, two reviewers (experienced dento- 
maxillofacial radiologists) independently screened the 
titles of the retrieved records, and only the titles re- 
lated to 2D cephalometry, radiographs for orthodontic 
treatment and tracings were included. Next, the ab- 
stracts of the retrieved publications were read by the 
two observers and categorised according to the study 
topic. An article had only to be justified by one obser- 
ver to be included for the second selection phase. All 
articles of interest in languages other than English 
were included. Of these two were included, one article 
was written in Portuguese and another in French. 
Eligibility of potential articles was determined by ap- 
plying the following inclusion criteria to the article 
abstracts: (1) technical efficacy, (2) diagnostic accur- 
acy efficacy, (3) diagnostic thinking efficacy, (4) thera- 
peutic efficacy, (5) patient outcome efficacy or any 
combination of the previous items as published by 
Fryback and Thornbury [16]. The other inclusion cri- 
teria were (1) accuracy, (2) reliability, (3) validity of 
lateral cephalometric radiograph, (4) landmark identi- 
fication on tracings (intra- and inter-observer errors) 
and (5) the effect of using 2D cephalometry on the 
orthodontic treatment plan. 
Diagnostic accuracy efficacy was defined as follows: 

1. Observer performance expressed as overall 
agreement, kappa index or correlation coefficients 

2. Diagnostic accuracy as percentage of correct 
landmark identification and further tracing analysis, 
validity and effectiveness of cephalometry in 
orthodontic treatment planning 

3. Sensitivity, specificity or predictive values of 
landmark identification 

Diagnostic thinking efficacy was defined as follows: 

1. Percentage of cases in a series in which images were 
judged 'helpful' for the diagnosis 

2. Difference in clinicians' subjective estimated 
diagnosis probabilities before and after evaluation of 
the cephalogram 

Therapeutic efficacy was defined as follows: 

1. Percentage of times the image was judged helpful in 
planning management of the patients in a case series 

2. Percentage of times therapy-planned pre- 
visualization of a lateral cephalogram needed to be 
changed after the image information was obtained 



Durao ef al. Progress in Orthodontics 2013, 14:31 
http://www.progressinorthodontics.eom/content/1 4/1 /3 1 



Page 3 of 1 1 



3. Percentage of times clinicians prospectively stated 
therapeutic choices needed to be changed after 
evaluating a cephalogram 

4. Whether different analyses lead to different 
decisions on treatment planning 

5. Intra- and inter-observer identification errors 

6. Reliability of landmark identification 

The analysis had to be based on primary materials 
or comprise a review on efficacy. When an abstract was 
considered by at least one author to be relevant, it was 
read in full text. At the second stage, the full texts were 
retrieved and critically examined. Reference lists of 
publications that had been found to be relevant in the 
first stage were hand-searched, and articles containing 
the words 'cephalometry', 'lateral cephalometric radio- 
graphy', together with 'treatment planning', 'orthodontic 
radiographs', 'landmark identification' and 'error' were 
selected. Book chapters and reviews were excluded 
since the aim of this systematic review was to evaluate 
primary studies. 

Data extraction 

Data was extracted with the aid of protocol 1 (Table SI 
in Additional file 1). It was established by reading the 
relevant literature on how to critically evaluate studies 
about diagnostic methods. To minimise bias, two ob- 
servers independently evaluated the quality and validity 
of original studies according to the quality assessment of 
diagnostic accuracy studies tool using protocol 2 (quality 
assessment of studies of diagnostic accuracy included in 
systematic reviews - QUADAS) (Table S2 in Additional 
file 1) [17]. When there was any disagreement con- 
cerning the relevance of an article, it was resolved by a 
discussion between the two reviewers. Each observer 
presented their arguments, and further discussion was 
held until a consensus was reached. Before the assessment, 
the protocols were tested for ten publications. A further 
five publications were read to calibrate the two reviewers 
regarding the criteria in protocol 2. Only publications that 
were found to be relevant to the reviewer in both proto- 
cols 1 (diagnostic efficacy) and 2 (level of evidence) were 
ultimately included. The quality and internal validity (level 
of evidence) of each publication was judged to be high, 
moderate or low according to the criteria in the following 
subsection [18]. 

Levels of evidence and criteria for evidence synttiesis 

High level of evidence A study was classified with high 
level of evidence if it fulfilled all of the following criteria: 

• There was an independent blind comparison 
between test and reference methods. 



• The population was described so that the status, 
prevalence and severity of the condition were clear. 
The spectrum of patients was similar to the 
spectrum of patients on whom the test method will 
be applied in clinical practice. 

• The results of the test method being evaluated did 
not influence the decision to perform the reference 
method(s). 

• Test and reference methods were well described 
concerning technique and implementation. 

• The judgments (observations and measurements) 
were well described considering diagnostic criteria 
applied and information and instructions to the 
observers. 

• The reproducibility of the test method was 
described for one observer (intra-observer 
performance) as well as for several (minimum 3) 
observers (inter-observer performance). 

• The results were presented in terms of relevant data 
needed for necessary calculations. 

Moderate level of evidence A study was assessed to 
have a moderate level of evidence if any of the above cri- 
teria were not met. On the other hand, the study was 
assessed not to have deficits that are described below for 
studies with a low level of evidence. 

Low level of evidence A study was assessed to have a 
low level of evidence if it met any of the following criteria: 

• The evaluation of the test and reference methods 
was nonindependent. 

• The population was not clearly described, and the 
spectrum of patients was distorted. 

• The results of the test method influenced the 
decision to perform the reference method. 

• The test or the reference method or both were not 
satisfactorily described. 

• The judgments were not well described. 

• The reproducibility of the test method was not 
described or was described for only one observer. 

• The results could have a systematic bias. 

• The results were not presented in a way that 
allowed efficacy calculations to be made. 

Rating conclusions according to evidence grade 

The scientific evidence of a conclusion on diagnostic effi- 
cacy was judged to be strong, moderately strong, limited 
or insufficient depending on the quality and internal vali- 
dity (level of evidence) of the publications assessed [18,19]: 

• Strong research-based evidence: at least two of the 
publications or a systematic review must have a 
high-level of evidence. 
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• Moderately strong research-based evidence: one of 
the publications must have a high level of evidence 
and two more of the publications must have a 
moderate level of evidence. 

• Limited research-based evidence: at least two of the 
publications must have a moderate level of evidence. 

• Insufficient research-based evidence: scientific 
evidence is insufficient or lacking according to the 
criteria defined in the present study. 

Synthesis of evidence 

The results of this review were described narratively. 
No meta-analyses were attempted because of lack of 
original studies. 

Results 

The number of articles reviewed in each phase to per- 
form this systematic review is presented in the PRISMA 
flow diagram (Figure 1) [20]. The initial search revealed 
784 articles listed in Medline (Ovid), 1,034 in Scopus 
and 264 articles in the Web of Science. The second stage 
of the search protocol was to retrieve the reference lists 
of the selected articles, which yielded 14 additional arti- 
cles of interest. After excluding 1,128 duplicates, 968 ar- 
ticles remained for review. In the first phase selection, 
the observers screened the articles by reading titles and 



Records identified 
through database 
searching 
(n =2082) 



Additional records 
identified through other 
sources 
(n=15) 



Records after duplicates removed 
(n = 968) 



Records screened 
(n = 968) 



Records excluded 
(n =765) 



Full-text articles 
assessed for eligibUity 
(n=35) 



Full-text articles 
excluded, with 
reasons 
(n=19) 



Studies included in 
qualitative synthesis 
(n=17) 



Figure 1 Methodology followed in the article selection process 
(adapted from Moher et al. [20]). 



abstracts. Articles that were not eligible because of 
irrelevant aims and were not directly related to this 
systematic review were excluded, thus 203 articles 
remained for further reading. Thirty-five articles were 
assessed for eligibility. 

After screening all the articles using protocols 1 and 2, 
17 articles met the inclusion criteria and were selected 
for qualitative synthesis and appraised to present some 
level of evidence. All articles that remained after screen- 
ing passed the qualitative synthesis. 

These 17 articles were categorised by topics as follows: 
7 studies on the role of cephalometry on the orthodontic 
treatment planning, 8 studies on cephalometric mea- 
surements and landmark identification and 2 studies on 
cephalometric analysis. 

Role of cephalometry on the orthodontic 
treatment planning 

Seven articles related to the importance and contribu- 
tion of cephalometry to orthodontic treatment planning 
were found (Table 1). Six of the publications were found 
to have low levels of evidence [2-4,6,9,10] and one classi- 
fied as moderate level of evidence [7]. 

Cephalometric measurements and landmark identification 

Only eight articles were selected as eligible in this cat- 
egory (Table 2). Five publications presented a moderate 
level of evidence [21-25], while the other three were 
identified as having a low level of evidence [5,26,27] . 

Cephalometric analysis 

Two publications with low-level evidence were found 
[28,29]. The studies did not use any reference standards, 
and the number of observers was not stated. The study 
designs were also not clearly explained (Table 3). 

Discussion 

The validity, efficacy and contribution of cephalometry 
in orthodontic treatment planning remain questionable 
[2]. In 2002, 90% of orthodontists in the USA routinely 
performed cephalometric radiographs [3]. This system- 
atic review was performed to assess the validity and reli- 
ability of 2D lateral cephalometry used for orthodontic 
treatment planning as well as the errors that can occur 
on 2D tracing. Despite the abundant amount of articles 
found on lateral cephalometry (« = 968), it is surprising 
that the present systematic review could only identify 
very few studies (« = 16, 1.6%) on its validity and reli- 
ability. This finding underlines the need for the present 
study and is an important cross point, considering the 
fact that we are flooding into 3D cephalometric studies 
nowadays. Apart from our findings, 2D cephalometry has 
other specific limitations, such as orthognatic surgery, air- 
way and growth assessment and skeletal maturation. In 



Table 1 Publications related to the Importance and contribution of cephalometry on the orthodontic treatment planning 



3r o 

c 

T3 S, 



Authors 
(year) 

Silling 
et al. [91 



Bruks 
et al. [6] 



Pae et al. 
[7] 



Nijkamp 
et al. [3] 



Devereux 
et al. [2] 



Aim of the study 

Assess usefulness of 
cephalometric analysis 



Observers 



Subjects Design of the study 



24 orthodontists 6 patients 



Stratified random design: 12 
orthodontists analysed 6 patients with 
cephalograms and 12 orthodontists 
studied 6 patients without cephalogram 



Statistical method 



Not referred 



Evaluation of lateral 
cephalometric and 
panoramic radiography 



Examine the link between 
lateral cephalograms and 
occlusal trays 



Influence of lateral 
cephalometry on treatment 
plan 



Influence of lateral 
cephalometry on treatment 

plan 



4 dentists and 
senior 

orthodontist 



70 patients Clinical evaluations and treatment plan 
by 4 dentists: 



1. Study casts + photographs 

2. Adding radiographs 



15 orthodontists 80 patients Tl: casts evaluated; 12 (1 week later): 
casts + lateral cephalograms 



Descriptive statistics and 
statistical analyses with 
computer software. Kruskal-Wallis 
test to evaluate differences 
between groups 



Rash model, regression plots, 
two-way ANOVA, post hoc 
multiple comparison Bonferroni 
and paired f test 



10 post-graduate 48 patients Randomised crossover design - Tl: casts. Overall proportion of agreement 
trainees and T2 (1 month after): with latera 

4 orthodontists cephalometry and tracing, and T3 and T4 

(repeated after 1 and 2 months) 



14 orthodontists 6 patients 



3 groups: (a) no lateral cephalogram and 
tracings, (b) some with lateral 
cephalogram and tracings and (c) al 
with lateral cephalogram and tracings 



Chi-square and binary 
logistic regression 



Results according to authors 

Class I patient: disagreement on 
extractions, anchorage and growth 
potential decisions 

No need for lateral cephalometry, 
except for atypical class II division 1 
patients, by 4 orthodontists 

Anchorage problems SS between 
patients with and without lateral 
cephalogram 

mpact on diagnosis relating to the 
ordering sequence of cephalogram: 
first choice, 58%; second choice, 
73%; third choice, 80% 

93% of cases: same treatment plan 
before and after radiographic 
analysis 

Class II division 2 patients: 125 
extractions planned at Tl; 80 at T2 

A lateral cephalogram influenced 
degree of severity, but not the 
difficulty of treatment 

Consistency of treatment plan was 
NS between the use only of dental 
casts or with additional 
cephalometry 

nfluence of cephalometrics on 
orthodontic treatment planning: NS 

Treatment plan changed for 
extraction pattern (42.9%), 
anchorage reinforcement (24%) and 
decision to extract (1 9.7%) 

Class I patient: lateral cephalogram 
less times ordered. Only patients 
where treatment plan changed 
after its analysis 

NS impact of cephalometrics on 
treatment plan 



Level of 
evidence 

Low 



Low 



Moderate 



Low 



Low 



O NJ 

3 2 

ii 



Table 1 Publications related to the Importance and contribution of cephalometry on the orthodontic treatment planning (Continued) 



Atchison Determine quantitatively 
et al. [4] the diagnosis and 

treatment plan information 
after radiograph evaluation 



Atchison Identify selection criteria 
et al. [10] for ordering orthodontic 
radiographs 



39 orthodontists 6 patients 



39 orthodontists 



I patients 



A 2-h interview for diagnosis and 
treatment planning of 6 cases. Study 
cast, intra- and extra-oral photographs, 
tracing and clinical findings available. 

A radiograph only if judged helpful 



A 2-h interview for diagnosis and 
treatment planning of 6 cases. Study 
cast, intra- and extra-oral photographs, 
tracing and clinical findings available 



Analysis of variance with 
repeated measures and 
covariance, homogeneity value 
and descriptive statistics 



Not referred 



98% of cases: at least one of the Low 
radiographs unproductive 



3/4 of radiographs did not provide 
information to change diagnosis 
and treatment plan 

14.4% of radiographs ordered for Low 
skeletal relationship of the jaws 

Lateral cephalograms accounted for 
34% of required information 

26% of all ordered radiographs 
produced modifications on 
diagnosis or treatment plan 

Pretreatment lateral cephalogram 
required in all patients needing 
orthodontic treatment 



NS, non-significant. 



Table 2 Publications concerning landmark identification 



Aim of the study Observers Subjects Design of the study 



3r o 

c 

T3 S, 



Authors 
(year) 



Statistical method 



Results according to authors 



Level of 
evidence 



Baumrind 
and Frantz 

[21] 



Kvam and 
Krogstad [27] 



Haynes and 
Chau [22] 



Quantification of errors 
in landmarl< 
identification 

Effects of errors on 
angular and linear 
measurements 

Evaluation of 
measurements in 
lateral cephalograms. 

Assess influence of 
knowledge and impact 
of angular errors 



Evaluation of landmark 
identification on 
Delaire analysis 

Comparison with data 
of conventiona 
cephalometry 



5 observers 



1 8 observers 



20 lateral skull 
radiographs 



3 lateral skull 
radiographs 



Observer identified 16 cephalometric 
landmarks on a transparent plastic 
template 



Hand cephalometric analysis made by 
each participant, 8 angles measured 



Mean, standard 
deviation and standard 
errors 



Mean and standard 
deviation 



2 observers 



28 lateral skull 
radiographs 



Establish a co-ordinate system for 
measurement on tracings 

Radiographs were traced twice by 
each observer (3 to 4 weeks) 



Mean deviation 



Ahlqvist et al. Study the magnitude 1 observer A patient was Computer software designed to allow Measurement errors 



[26] 



Houston 
et al. [23] 



Kamoen et al. 
[24] 



of projection errors on 
measurements in 
cephalometry 

Study the effects of 
incorrect patient 
position on linear 
measurements 

Evaluate errors at 
various stages of 
measurements in 
cephalometric 
radiograph 



Determine errors 
involved in landmark 
identification and its 
consequence to 
treatment results 



modelled 



4 observers 



4 observers 



24 lateral 
cephalograms 



50 lateral 
cephalograms 



movement of model on the 3 axes. 
The magnitude of errors was studied 
by a diagram 



2 radiographs of the same patient 



Radiographs traced on acetate sheet 
by each observer at T1/T2 (1-week 
interval) 

Items studied: (1) accuracy of digitiser, 
(2) intra- and inter-observer digitising 
errors and (3) intra- and inter-observer 
tracing errors 



studied by a diagram 
with the relative length 
of distances between 
modelled landmarks 



Analysis of variance 



(1) Levene's test for 
homogeneity of 
variances, (2) one-way 
ANOVA and (3) Levene's 
test for homogeneity 



Least reliable landmarks: gonion and lower i Moderate 
ncisor apex 



16 out of 24 angular measurements: less Low 
variability in post-graduates than students 

In 7 measurements, no difference was observed 

Post-graduates' tracings used for diagnostic 
purposes 

Standard deviation of students greater than 
post-graduates 

Intra-observer: NS differences between values Moderate 
of T1 and T2 tracings 

Inter-observer: differences between the 
averaged mean values on tracings were NS for 
either x or y co-ordinates 

Less than 1% error on length measurements if Low 
head is rotated up to 5° 



Head rotated more than 5° the error is increased 



Error variance is small (radiograph and Moderate 
tracing) when compared with the variance 
among groups 

SNA has a higher tracing variance than 
SNB due to the difficulty to identify point A 

(1) NS variances of co-ordinates for landmark at Moderate 

different positions on the digitiser. (2) NS intra- 

and inter-observer differences in digitisation. (3) 

S differences in landmarks and in the same 

landmark on different cephalograms and 

between observers 



O M 

3 2 

ii 



Table 2 Publications concerning landmark identification (Continued) 



Tng et al. [25] Evaluate the validity of 1 observer 2 lateral 



Bourriau 
et al. [5] 



dental and skeletal 
landmarks. Effect on 
angles and distances. 



Analyse the influence 
of film-object distance 
and type of receptor 
on landmark 
identification 



cephalograms 
of 30 dry 
skulls 



53 4 lateral 

orthodontists cephalograms 
of the same 
patient 



Steel balls placed in 15 dental and 
skeletal landmarks 

Two radiographs taken with and 
without the markers and digitised. 
Measurements compared 



19 cephalometric landmarks on each 
film 



2 radiographs performed at an 
equipment with a 4-m arm and 2 in a 
1.50-m arm equipment with 2 
different imaging receptors (digital 
and indirect digital) 



Mean and standard 
deviation 



Mean 



7 out of 10 skeletal and 5 dental landmarks Moderate 
were NS (p < 0.05) 

4 angles (SNA-SN/MnP, MxP/MnP and LI/MnP) 
and 3 distances (N-Me, MxP-Me and Lie to APg) 
were invalid (p < 0.05) 

Major errors in angles with dental landmarks 

NS difference between 2 imaging receptors Low 
neither between 2 cephalograms achieved by 2 
equipments (p > 0.99) 

Results obtained by cephalometric analysis was 
judged: 'very important' for 20.5%, 'important' 
for 70%, 'less important' for 8% and 'accessory' 
for 1 participant 



NS, non-significant; S, significant. 
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Table 3 Publications on cephalometric analysis 



Authors Aim of the study Observers Subjects Design of the Statistical Results according to authors Level of 

(year) study method evidence 







Kl(~it rf^tf^rrf^M 

1 L 1 CI CI 1 CCl 


1 29 




Not 


^ r^i it r^f \A/ith <:imi ar 1 r^\A/ 
J UUL Ul LJI t..aiCj VVILll Jllllllal LUVV 




unanimity for different 




patients 


based on Rickel^s, 


referred 


diagnosis, n 23 cases, 4 ana yses 




ceplnslometric analyses 






Steiner, Cervera and 




achieved simi ar diagnosis. In 13 










Coutand 




cases, 3 different diagnoses were 










cephalometric 




obtained. In 8 cases, the 










analyses 




diagnosis was different for class II 














and class II 


Abdullah 


Examine accuracy and 


Different 


275 


Radiographs traced 


Paired r 


The predicted change in LI Low 


et al. [29] 


precision of Steiner analysis 


orthodontists 


patients 


and analysed by 


test, mean 


(lower incisor) to NB was 




for clianges on ANB angle, 


(not 




orthodontists 


and 


underestimated by 0.8 mm. Only 




the Pg-NB distance and upper 


reference to 




according to the 


standard 


the prediction for pogonion and 




and lower incisor positions 


the number) 




Steiner analysis 


deviation 


NB showed improvement of the 










Radiographs at the 




precision (30%) 










end of treatment 














(T2) were traced by 














one observer 







order to be included in this systematic review, publications 
had to satisfy pre-defined methodological criteria. Two 
protocols were used regarding the search strategy, one 
based on diagnostic methods and the second based on the 
QUAD AS tool [17]. The 'levels of evidence' for assessing 
the quality and internal quality of each publication in- 
cluded in this review - how well the study was designed, 
how reliable its results appeared to be and the extent to 
which it addressed the questions posed - were modified 
according to the Oxford Center for Evidence-Based Medi- 
cine levels of evidence for diagnostic methods (CBEM) 
[18]. Only publications assessed to present a high or mod- 
erate level of evidence can form the basis for any scientific 
conclusions. Ten articles were identified as low level of 
evidence, five had moderate level and only one showed 
high level of evidence. 

All retrieved articles, assessing the importance and 
contribution of lateral cephalometric radiograph in 
orthodontic treatment, concluded that there is no sig- 
nificant difference on treatment planning decision with 
or without the evaluation of the lateral cephalogram. 
However, it should be considered that the suitable stud- 
ies in this review were based on small samples rather 
than large cohorts representing the entire population. In 
one study, the sample used was restricted (six patients) 
[2]. Furthermore, the short time lapse between observa- 
tions in some studies did not allow a full washout effect, 
which could lead to the repetition of the results [4,7,10]. 
The latter bias is further strengthened by the fact that 
recognition factors were often included, e.g. the possibil- 
ity of identifying patient by photographic visualisation as 
part of the examination. On the other hand, in one 
paper, only dental casts were presented to the observers, 
which might also lead to error since it does not mimic 
the clinical situation. Sample bias is also suspected based 
on the fact that selection of subjects is often poorly de- 
scribed or unclear [2,6,9], like the questions made to the 



observers that were not stated by any questionnaire [6], 
and in one article, observers were forced to choose yes/ 
no answers, which again do not perfectly simulate the 
reality [3]. 

In the two articles by Atchison et al. there was the 
possibility to identify patients as well as sample size was 
very restricted (six patients). There was no repetition of 
the questionnaire to test the variability between answers 
[4,10]. When it comes to the validity and reliability of 
cephalometric analysis, several errors should be consid- 
ered: landmark identification, tracing and measuring, 
and magnification of certain anatomical structures. 

Landmarks placed in anatomically formed edges are 
easier to identify, while some landmarks placed on 
curves are more prone to error. The gonion and lower 
incisor apex are the least consistent landmarks [21]. Fur- 
thermore, landmarks such as point A have a higher vari- 
ance than others like point B because of wider variation 
and anatomical localisation of point A [23]. Dental land- 
marks tend to have poorer validity than skeletal land- 
marks. Also, when landmarks are located on a curve like 
point A, point B or pogonion, the error is larger [25]. 
The evidence shows that landmark identification is a 
great source of error in 2D lateral cephalometry [24]. 
Major errors in angles with dental landmarks may occur 
[25] . In addition, different levels of knowledge and expe- 
riences between the observers also lead to varying re- 
sults on landmark identification. In a study using 18 
observers, in which 13 were dental students and 5 were 
post-graduate students, post-graduate's revealed lower 
intra-observer tracing variance than dental students [27]. 
Patient positioning during the procedure is also very im- 
portant to avoid errors on measurements and landmark 
identification [23,26]. The publication of Ahlqvist et al. 
[26] was assessed with a low level of evidence because 
there was only one observer. A similar classification oc- 
curred for Bourriau et al. [5], intra-observer agreement 
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could not be evaluated and the number of radiographs 
(n = 4) used was very low. Kvam and Krogstad's [27] 
publication also used a limited number of subjects (n = 
3). The choice of the observers also plays an important 
role on the results. Eighteen observers, in which 13 were 
dental and 5 were post-graduate students, participated 
in their study [27]. The latter can also bias results be- 
cause of the distinct level of education and expertise due 
to the lack of experience of the observers. 

Regarding the influence of magnification, Bourriau et al. 
[5] could not identify significant differences between 
equipment with a 4-m distant cephalometric machine and 
a 1.5-m distant cephalometric arm. Despite that, it should 
be considered that distance varying between the X-ray 
source and the image receptor will always cause a degree 
of magnification, the larger the distance, the lower the 
magnification. A focus object distance of 4 m in 2D cepha- 
lometric equipment is usually favoured for the reduced 
radiation burden and lack of enlargement, while an equip- 
ment with 1.5-m arm has a direct advantage of being com- 
pact and integrated in a multimodal system as well as 
having an increased resolution. On the other hand, pano- 
ramic equipment with a cephalometric arm at a 1.5-m dis- 
tance may present shortcomings in enlargement factors 
and superimposition of the bilateral structures more dis- 
tant from the midsagittal plane, considering the less mag- 
nified structures on the side nearby the image receptor 
[30] . We were not able to identify studies correlating land- 
mark identification errors in lateral cephalograms and 
their influence on the outcome of patient treatment. 

Finally, in 1982, De Abreu showed that different 2D 
cephalometric analysis may lead to different diagnosis of 
the same patient, varying the diagnosis between class II 
and class III in 8 out of 129 cases [28]. Also, Abdullah 
et al. [29] found that Steiner's cephalometric analysis is 
not accurate enough to plan orthodontic treatment. Both 
publications were assessed with low levels of evidence. 
In both publications, the number of observers was not 
referred. Furthermore, the statistical method used was 
not mentioned in [28]. 

The accuracy in the evaluation of the results, as well 
as producing changes in the treatment compared with 
clinical evaluation, seems to be one of the major bene- 
fits of 2D cephalometry. Risk-benefit analysis should be 
carefully evaluated. 

Conclusion 

The existing literature suggested that lateral cephalometric 
radiographs have been used without adequate scientific 
evidence of its usefulness and are often used prior to treat- 
ment. There is a need for diagnostic accuracy studies on 
2D lateral cephalometric radiograph where standardised 
methodological criteria for diagnostic thinking efficacy 
and therapeutic efficacy are incorporated. This systematic 



review has shown that the evidence to agree or disagree 
on the usefulness of this radiographic technique in ortho- 
dontics today is limited. Lateral cephalograms are used in 
many occasions for reasons other than clinical diagnosis 
or treatment, such as medico-legal reasons in a teaching 
environment or due to a lack of experience in the field. 
These conclusions are rather worrying. The use of radiation 
in children should be even better justified, and scientific 
evidence of that justification seems lacking. At present, 
there is a need for further studies on larger patient po- 
pulations, focusing on the therapeutic efficacy of lateral 
cephalograms. 
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